Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 65280 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 16.5 MiB |
| Average record size in memory | 265.7 B |
Variable types
| Numeric | 13 |
|---|---|
| Categorical | 4 |
| DateTime | 1 |
Item has a high cardinality: 657 distinct values | High cardinality |
df_index is highly correlated with Invoice_Quarter and 1 other fields | High correlation |
Invoice_Quarter is highly correlated with df_index and 1 other fields | High correlation |
Invoice_Month is highly correlated with df_index and 1 other fields | High correlation |
Sales Quantity is highly correlated with List Price and 1 other fields | High correlation |
Sales Amount is highly correlated with Sales Amount Based on List Price and 3 other fields | High correlation |
Sales Amount Based on List Price is highly correlated with Sales Amount and 3 other fields | High correlation |
Discount Amount is highly correlated with Sales Amount and 3 other fields | High correlation |
Sales Margin Amount is highly correlated with Sales Amount and 3 other fields | High correlation |
Sales Cost Amount is highly correlated with Sales Amount and 3 other fields | High correlation |
List Price is highly correlated with Sales Quantity and 1 other fields | High correlation |
Sales Price is highly correlated with Sales Quantity and 1 other fields | High correlation |
df_index is highly correlated with Invoice_Quarter and 1 other fields | High correlation |
Invoice_Quarter is highly correlated with df_index and 1 other fields | High correlation |
Invoice_Month is highly correlated with df_index and 1 other fields | High correlation |
Sales Quantity is highly correlated with Sales Amount and 3 other fields | High correlation |
Sales Amount is highly correlated with Sales Quantity and 3 other fields | High correlation |
Sales Amount Based on List Price is highly correlated with Sales Quantity and 4 other fields | High correlation |
Discount Amount is highly correlated with Sales Amount Based on List Price | High correlation |
Sales Margin Amount is highly correlated with Sales Quantity and 3 other fields | High correlation |
Sales Cost Amount is highly correlated with Sales Quantity and 3 other fields | High correlation |
List Price is highly correlated with Sales Price | High correlation |
Sales Price is highly correlated with List Price | High correlation |
df_index is highly correlated with Invoice_Quarter and 1 other fields | High correlation |
Invoice_Quarter is highly correlated with df_index and 1 other fields | High correlation |
Invoice_Month is highly correlated with df_index and 1 other fields | High correlation |
Sales Amount is highly correlated with Sales Amount Based on List Price and 3 other fields | High correlation |
Sales Amount Based on List Price is highly correlated with Sales Amount and 3 other fields | High correlation |
Discount Amount is highly correlated with Sales Amount and 3 other fields | High correlation |
Sales Margin Amount is highly correlated with Sales Amount and 3 other fields | High correlation |
Sales Cost Amount is highly correlated with Sales Amount and 3 other fields | High correlation |
List Price is highly correlated with Sales Price | High correlation |
Sales Price is highly correlated with List Price | High correlation |
df_index is highly correlated with Invoice_Year and 2 other fields | High correlation |
CustKey is highly correlated with Sales Rep | High correlation |
Invoice_Year is highly correlated with df_index and 1 other fields | High correlation |
Invoice_Quarter is highly correlated with df_index and 1 other fields | High correlation |
Invoice_Month is highly correlated with df_index and 2 other fields | High correlation |
Sales Quantity is highly correlated with Sales Amount and 3 other fields | High correlation |
Sales Amount is highly correlated with Sales Quantity and 4 other fields | High correlation |
Sales Amount Based on List Price is highly correlated with Sales Quantity and 4 other fields | High correlation |
Discount Amount is highly correlated with Sales Amount and 3 other fields | High correlation |
Sales Margin Amount is highly correlated with Sales Quantity and 4 other fields | High correlation |
Sales Cost Amount is highly correlated with Sales Quantity and 4 other fields | High correlation |
Sales Rep is highly correlated with CustKey | High correlation |
List Price is highly correlated with Sales Price | High correlation |
Sales Price is highly correlated with List Price | High correlation |
Sales Quantity is highly skewed (γ1 = 23.00722057) | Skewed |
Sales Cost Amount is highly skewed (γ1 = 21.01063149) | Skewed |
df_index is uniformly distributed | Uniform |
df_index has unique values | Unique |
Discount Amount has 1214 (1.9%) zeros | Zeros |
Reproduction
| Analysis started | 2022-10-06 03:19:54.147763 |
|---|---|
| Analysis finished | 2022-10-06 03:20:20.926352 |
| Duration | 26.78 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
df_index
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIFORMUNIQUE| Distinct | 65280 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32640.96425 |
| Minimum | 0 |
|---|---|
| Maximum | 65281 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 510.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3264.95 |
| Q1 | 16320.75 |
| median | 32640.5 |
| Q3 | 48961.25 |
| 95-th percentile | 62017.05 |
| Maximum | 65281 |
| Range | 65281 |
| Interquartile range (IQR) | 32640.5 |
Descriptive statistics
| Standard deviation | 18845.29037 |
|---|---|
| Coefficient of variation (CV) | 0.5773509086 |
| Kurtosis | -1.200026217 |
| Mean | 32640.96425 |
| Median Absolute Deviation (MAD) | 16320.5 |
| Skewness | 5.038305807 × 10-6 |
| Sum | 2130802146 |
| Variance | 355144969 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 43527 | 1 | < 0.1% |
| 43514 | 1 | < 0.1% |
| 43515 | 1 | < 0.1% |
| 43516 | 1 | < 0.1% |
| 43517 | 1 | < 0.1% |
| 43518 | 1 | < 0.1% |
| 43519 | 1 | < 0.1% |
| 43520 | 1 | < 0.1% |
| 43521 | 1 | < 0.1% |
| Other values (65270) | 65270 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 65281 | 1 | |
| 65280 | 1 | |
| 65279 | 1 | |
| 65278 | 1 | |
| 65277 | 1 | |
| 65276 | 1 | |
| 65275 | 1 | |
| 65274 | 1 | |
| 65273 | 1 | |
| 65272 | 1 |
| Distinct | 615 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10017702.67 |
| Minimum | 10000453 |
|---|---|
| Maximum | 10027583 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 510.1 KiB |
Quantile statistics
| Minimum | 10000453 |
|---|---|
| 5-th percentile | 10002506 |
| Q1 | 10012715 |
| median | 10019665 |
| Q3 | 10023511 |
| 95-th percentile | 10026006.15 |
| Maximum | 10027583 |
| Range | 27130 |
| Interquartile range (IQR) | 10796 |
Descriptive statistics
| Standard deviation | 7176.243993 |
|---|---|
| Coefficient of variation (CV) | 0.0007163562571 |
| Kurtosis | -0.3714057462 |
| Mean | 10017702.67 |
| Median Absolute Deviation (MAD) | 4886 |
| Skewness | -0.7701959384 |
| Sum | 6.539556306 × 1011 |
| Variance | 51498477.84 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10025919 | 2760 | 4.2% |
| 10019194 | 2752 | 4.2% |
| 10012715 | 1431 | 2.2% |
| 10012226 | 1389 | 2.1% |
| 10025025 | 1143 | 1.8% |
| 10023524 | 1042 | 1.6% |
| 10020515 | 1010 | 1.5% |
| 10017638 | 792 | 1.2% |
| 10022456 | 741 | 1.1% |
| 10002506 | 714 | 1.1% |
| Other values (605) | 51506 |
| Value | Count | Frequency (%) |
| 10000453 | 329 | |
| 10000455 | 19 | < 0.1% |
| 10000456 | 104 | 0.2% |
| 10000457 | 19 | < 0.1% |
| 10000458 | 10 | < 0.1% |
| 10000460 | 120 | 0.2% |
| 10000461 | 251 | |
| 10000462 | 3 | < 0.1% |
| 10000466 | 123 | 0.2% |
| 10000469 | 162 |
| Value | Count | Frequency (%) |
| 10027583 | 25 | < 0.1% |
| 10027575 | 5 | < 0.1% |
| 10027572 | 52 | 0.1% |
| 10027560 | 42 | 0.1% |
| 10027381 | 108 | |
| 10027370 | 235 | |
| 10027356 | 21 | < 0.1% |
| 10027348 | 14 | < 0.1% |
| 10027340 | 35 | 0.1% |
| 10027119 | 176 |
| Distinct | 657 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.9 MiB |
| Better Fancy Canned Sardines | 1648 |
|---|---|
| Ebony Prepared Salad | 1471 |
| Moms Sliced Turkey | 1192 |
| Imagine Popsicles | 1191 |
| Discover Manicotti | 1126 |
| Other values (652) |
Length
| Max length | 37 |
|---|---|
| Median length | 32 |
| Mean length | 21.72234988 |
| Min length | 8 |
Characters and Unicode
| Total characters | 1418035 |
|---|---|
| Distinct characters | 56 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 30 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Urban Large Eggs |
|---|---|
| 2nd row | Moms Sliced Turkey |
| 3rd row | Cutting Edge Foot-Long Hot Dogs |
| 4th row | Kiwi Lox |
| 5th row | High Top Sweet Onion |
Common Values
| Value | Count | Frequency (%) |
| Better Fancy Canned Sardines | 1648 | 2.5% |
| Ebony Prepared Salad | 1471 | 2.3% |
| Moms Sliced Turkey | 1192 | 1.8% |
| Imagine Popsicles | 1191 | 1.8% |
| Discover Manicotti | 1126 | 1.7% |
| Red Spade Foot-Long Hot Dogs | 1075 | 1.6% |
| High Top Dried Mushrooms | 1073 | 1.6% |
| Big Time Frozen Cheese Pizza | 947 | 1.5% |
| Cutting Edge Foot-Long Hot Dogs | 942 | 1.4% |
| Bravo Large Canned Shrimp | 941 | 1.4% |
| Other values (647) | 53674 |
Length
| Value | Count | Frequency (%) |
| canned | 6378 | 2.8% |
| ebony | 5460 | 2.4% |
| cheese | 5194 | 2.3% |
| better | 4570 | 2.0% |
| red | 4271 | 1.9% |
| top | 4173 | 1.8% |
| spade | 4161 | 1.8% |
| high | 4138 | 1.8% |
| best | 3480 | 1.5% |
| nationeel | 3328 | 1.4% |
| Other values (294) | 184532 |
Most occurring characters
| Value | Count | Frequency (%) |
| 164405 | 11.6% | |
| e | 147096 | 10.4% |
| o | 92465 | 6.5% |
| a | 92274 | 6.5% |
| n | 74072 | 5.2% |
| i | 69458 | 4.9% |
| t | 68731 | 4.8% |
| r | 67351 | 4.7% |
| l | 59835 | 4.2% |
| s | 57796 | 4.1% |
| Other values (46) | 524552 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1013733 | |
| Uppercase Letter | 236241 | 16.7% |
| Space Separator | 164405 | 11.6% |
| Dash Punctuation | 2160 | 0.2% |
| Other Punctuation | 748 | 0.1% |
| Decimal Number | 748 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 147096 | |
| o | 92465 | 9.1% |
| a | 92274 | 9.1% |
| n | 74072 | 7.3% |
| i | 69458 | 6.9% |
| t | 68731 | 6.8% |
| r | 67351 | 6.6% |
| l | 59835 | 5.9% |
| s | 57796 | 5.7% |
| d | 40545 | 4.0% |
| Other values (16) | 244110 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 31428 | |
| C | 30873 | |
| S | 26130 | |
| T | 19374 | 8.2% |
| F | 16239 | 6.9% |
| M | 12520 | 5.3% |
| P | 11469 | 4.9% |
| L | 11022 | 4.7% |
| D | 10754 | 4.6% |
| E | 10237 | 4.3% |
| Other values (15) | 56195 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 579 | |
| 2 | 169 | 22.6% |
Space Separator
| Value | Count | Frequency (%) |
| 164405 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2160 |
Other Punctuation
| Value | Count | Frequency (%) |
| % | 748 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1249974 | |
| Common | 168061 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 147096 | 11.8% |
| o | 92465 | 7.4% |
| a | 92274 | 7.4% |
| n | 74072 | 5.9% |
| i | 69458 | 5.6% |
| t | 68731 | 5.5% |
| r | 67351 | 5.4% |
| l | 59835 | 4.8% |
| s | 57796 | 4.6% |
| d | 40545 | 3.2% |
| Other values (41) | 480351 |
Common
| Value | Count | Frequency (%) |
| 164405 | ||
| - | 2160 | 1.3% |
| % | 748 | 0.4% |
| 1 | 579 | 0.3% |
| 2 | 169 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1418035 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 164405 | 11.6% | |
| e | 147096 | 10.4% |
| o | 92465 | 6.5% |
| a | 92274 | 6.5% |
| n | 74072 | 5.2% |
| i | 69458 | 4.9% |
| t | 68731 | 4.8% |
| r | 67351 | 4.7% |
| l | 59835 | 4.2% |
| s | 57796 | 4.1% |
| Other values (46) | 524552 |
Invoice Date
Date
| Distinct | 559 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 510.1 KiB |
| Minimum | 2017-01-01 00:00:00 |
|---|---|
| Maximum | 2019-12-31 00:00:00 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.8 MiB |
| 2017 | |
|---|---|
| 2019 | |
| 2018 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 261120 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2017 |
|---|---|
| 2nd row | 2017 |
| 3rd row | 2017 |
| 4th row | 2017 |
| 5th row | 2017 |
Common Values
| Value | Count | Frequency (%) |
| 2017 | 30573 | |
| 2019 | 28021 | |
| 2018 | 6686 | 10.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2017 | 30573 | |
| 2019 | 28021 | |
| 2018 | 6686 | 10.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 65280 | |
| 0 | 65280 | |
| 1 | 65280 | |
| 7 | 30573 | |
| 9 | 28021 | |
| 8 | 6686 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 261120 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 65280 | |
| 0 | 65280 | |
| 1 | 65280 | |
| 7 | 30573 | |
| 9 | 28021 | |
| 8 | 6686 | 2.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 261120 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 65280 | |
| 0 | 65280 | |
| 1 | 65280 | |
| 7 | 30573 | |
| 9 | 28021 | |
| 8 | 6686 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 261120 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 65280 | |
| 0 | 65280 | |
| 1 | 65280 | |
| 7 | 30573 | |
| 9 | 28021 | |
| 8 | 6686 | 2.6% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.6 MiB |
| 1 | |
|---|---|
| 4 | |
| 3 | |
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 65280 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 3 |
| 3rd row | 4 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 19930 | |
| 4 | 16142 | |
| 3 | 14688 | |
| 2 | 14520 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1 | 19930 | |
| 4 | 16142 | |
| 3 | 14688 | |
| 2 | 14520 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 19930 | |
| 4 | 16142 | |
| 3 | 14688 | |
| 2 | 14520 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 65280 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 19930 | |
| 4 | 16142 | |
| 3 | 14688 | |
| 2 | 14520 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 65280 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 19930 | |
| 4 | 16142 | |
| 3 | 14688 | |
| 2 | 14520 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 65280 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 19930 | |
| 4 | 16142 | |
| 3 | 14688 | |
| 2 | 14520 |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.307000613 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 510.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.563557849 |
|---|---|
| Coefficient of variation (CV) | 0.5650162522 |
| Kurtosis | -1.304080373 |
| Mean | 6.307000613 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.07649377464 |
| Sum | 411721 |
| Variance | 12.69894454 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 7308 | |
| 2 | 6556 | |
| 1 | 6066 | |
| 12 | 5645 | |
| 9 | 5555 | |
| 6 | 5376 | |
| 10 | 5250 | |
| 11 | 5247 | |
| 5 | 5167 | |
| 8 | 4737 | |
| Other values (2) | 8373 |
| Value | Count | Frequency (%) |
| 1 | 6066 | |
| 2 | 6556 | |
| 3 | 7308 | |
| 4 | 3977 | |
| 5 | 5167 | |
| 6 | 5376 | |
| 7 | 4396 | |
| 8 | 4737 | |
| 9 | 5555 | |
| 10 | 5250 |
| Value | Count | Frequency (%) |
| 12 | 5645 | |
| 11 | 5247 | |
| 10 | 5250 | |
| 9 | 5555 | |
| 8 | 4737 | |
| 7 | 4396 | |
| 6 | 5376 | |
| 5 | 5167 | |
| 4 | 3977 | |
| 3 | 7308 |
Invoice_Day
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.15589767 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 510.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 9 |
| median | 16 |
| Q3 | 24 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.795337539 |
|---|---|
| Coefficient of variation (CV) | 0.5444041376 |
| Kurtosis | -1.224628553 |
| Mean | 16.15589767 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.02103293504 |
| Sum | 1054657 |
| Variance | 77.35796243 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11 | 2879 | 4.4% |
| 29 | 2831 | 4.3% |
| 10 | 2571 | 3.9% |
| 18 | 2487 | 3.8% |
| 26 | 2436 | 3.7% |
| 23 | 2396 | 3.7% |
| 21 | 2334 | 3.6% |
| 30 | 2333 | 3.6% |
| 9 | 2271 | 3.5% |
| 5 | 2268 | 3.5% |
| Other values (21) | 40474 |
| Value | Count | Frequency (%) |
| 1 | 1850 | |
| 2 | 1671 | |
| 3 | 2001 | |
| 4 | 1701 | |
| 5 | 2268 | |
| 6 | 2115 | |
| 7 | 2185 | |
| 8 | 2245 | |
| 9 | 2271 | |
| 10 | 2571 |
| Value | Count | Frequency (%) |
| 31 | 1019 | 1.6% |
| 30 | 2333 | |
| 29 | 2831 | |
| 28 | 2093 | |
| 27 | 2140 | |
| 26 | 2436 | |
| 25 | 2123 | |
| 24 | 2065 | |
| 23 | 2396 | |
| 22 | 2220 |
| Distinct | 279 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45.08570772 |
| Minimum | 1 |
|---|---|
| Maximum | 16000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 510.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 8 |
| 95-th percentile | 86 |
| Maximum | 16000 |
| Range | 15999 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 429.6683008 |
|---|---|
| Coefficient of variation (CV) | 9.530033408 |
| Kurtosis | 649.737599 |
| Mean | 45.08570772 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 23.00722057 |
| Sum | 2943195 |
| Variance | 184614.8487 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 15264 | |
| 2 | 13466 | |
| 3 | 7056 | |
| 4 | 4973 | 7.6% |
| 5 | 3519 | 5.4% |
| 6 | 3061 | 4.7% |
| 10 | 2596 | 4.0% |
| 8 | 1460 | 2.2% |
| 12 | 1314 | 2.0% |
| 20 | 1034 | 1.6% |
| Other values (269) | 11537 |
| Value | Count | Frequency (%) |
| 1 | 15264 | |
| 2 | 13466 | |
| 3 | 7056 | |
| 4 | 4973 | 7.6% |
| 5 | 3519 | 5.4% |
| 6 | 3061 | 4.7% |
| 7 | 711 | 1.1% |
| 8 | 1460 | 2.2% |
| 9 | 453 | 0.7% |
| 10 | 2596 | 4.0% |
| Value | Count | Frequency (%) |
| 16000 | 11 | < 0.1% |
| 13600 | 12 | < 0.1% |
| 9504 | 7 | < 0.1% |
| 8316 | 40 | |
| 7128 | 21 | |
| 7126 | 2 | < 0.1% |
| 6480 | 2 | < 0.1% |
| 6400 | 4 | < 0.1% |
| 5834 | 4 | < 0.1% |
| 4752 | 13 | < 0.1% |
| Distinct | 17895 |
|---|---|
| Distinct (%) | 27.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2852.043002 |
| Minimum | 200.01 |
|---|---|
| Maximum | 555376 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 510.1 KiB |
Quantile statistics
| Minimum | 200.01 |
|---|---|
| 5-th percentile | 215.78 |
| Q1 | 308.38 |
| median | 553.94 |
| Q3 | 1279.9875 |
| 95-th percentile | 8777.79 |
| Maximum | 555376 |
| Range | 555175.99 |
| Interquartile range (IQR) | 971.6075 |
Descriptive statistics
| Standard deviation | 15164.56904 |
|---|---|
| Coefficient of variation (CV) | 5.3170899 |
| Kurtosis | 478.9000606 |
| Mean | 2852.043002 |
| Median Absolute Deviation (MAD) | 292.92 |
| Skewness | 18.57841346 |
| Sum | 186181367.2 |
| Variance | 229964154.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 784.97 | 115 | 0.2% |
| 817.68 | 115 | 0.2% |
| 294.72 | 110 | 0.2% |
| 307 | 104 | 0.2% |
| 597.14 | 102 | 0.2% |
| 622.02 | 101 | 0.2% |
| 824.39 | 100 | 0.2% |
| 791.41 | 99 | 0.2% |
| 401.16 | 95 | 0.1% |
| 204.66 | 92 | 0.1% |
| Other values (17885) | 64247 |
| Value | Count | Frequency (%) |
| 200.01 | 6 | |
| 200.06 | 6 | |
| 200.08 | 1 | < 0.1% |
| 200.14 | 3 | |
| 200.15 | 5 | |
| 200.19 | 7 | |
| 200.21 | 1 | < 0.1% |
| 200.3 | 3 | |
| 200.36 | 1 | < 0.1% |
| 200.37 | 6 |
| Value | Count | Frequency (%) |
| 555376 | 1 | < 0.1% |
| 539200 | 5 | |
| 517632 | 5 | |
| 472069.6 | 2 | < 0.1% |
| 458320 | 5 | |
| 439987.2 | 5 | |
| 310156.07 | 1 | < 0.1% |
| 301122.4 | 2 | < 0.1% |
| 297240 | 1 | < 0.1% |
| 289077.5 | 2 | < 0.1% |
Sales Amount Based on List Price
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 4060 |
|---|---|
| Distinct (%) | 6.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4707.617837 |
| Minimum | 0 |
|---|---|
| Maximum | 632610.16 |
| Zeros | 294 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 510.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 390 |
| Q1 | 561.04 |
| median | 998.16 |
| Q3 | 2316.63 |
| 95-th percentile | 16425.12 |
| Maximum | 632610.16 |
| Range | 632610.16 |
| Interquartile range (IQR) | 1755.59 |
Descriptive statistics
| Standard deviation | 20696.74443 |
|---|---|
| Coefficient of variation (CV) | 4.396436827 |
| Kurtosis | 278.7073688 |
| Mean | 4707.617837 |
| Median Absolute Deviation (MAD) | 524.88 |
| Skewness | 14.07462245 |
| Sum | 307313292.4 |
| Variance | 428355229.8 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1431.23 | 590 | 0.9% |
| 1627.84 | 530 | 0.8% |
| 803.86 | 498 | 0.8% |
| 596 | 448 | 0.7% |
| 1254.1899 | 418 | 0.6% |
| 966.44 | 376 | 0.6% |
| 439.7 | 372 | 0.6% |
| 507.75 | 363 | 0.6% |
| 767.75 | 348 | 0.5% |
| 939.57 | 343 | 0.5% |
| Other values (4050) | 60994 |
| Value | Count | Frequency (%) |
| 0 | 294 | |
| 194 | 2 | < 0.1% |
| 195.61 | 1 | < 0.1% |
| 198.396 | 1 | < 0.1% |
| 198.63 | 1 | < 0.1% |
| 200.7 | 8 | < 0.1% |
| 200.8 | 1 | < 0.1% |
| 201.69 | 3 | < 0.1% |
| 202.14 | 1 | < 0.1% |
| 202.6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 632610.16 | 5 | |
| 624453.75 | 2 | < 0.1% |
| 539200 | 11 | |
| 458320 | 12 | |
| 391924.7232 | 5 | |
| 387395 | 8 | |
| 348655.5 | 2 | < 0.1% |
| 332196.405 | 2 | < 0.1% |
| 330708.3792 | 5 | |
| 310273.7392 | 2 | < 0.1% |
Discount Amount
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 17820 |
|---|---|
| Distinct (%) | 27.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1855.574835 |
| Minimum | -255820.8 |
|---|---|
| Maximum | 343532.66 |
| Zeros | 1214 |
| Zeros (%) | 1.9% |
| Negative | 972 |
| Negative (%) | 1.5% |
| Memory size | 510.1 KiB |
Quantile statistics
| Minimum | -255820.8 |
|---|---|
| 5-th percentile | 18.68 |
| Q1 | 246.0375 |
| median | 441.76 |
| Q3 | 999.76 |
| 95-th percentile | 6353 |
| Maximum | 343532.66 |
| Range | 599353.46 |
| Interquartile range (IQR) | 753.7225 |
Descriptive statistics
| Standard deviation | 9037.140888 |
|---|---|
| Coefficient of variation (CV) | 4.870264847 |
| Kurtosis | 379.7363588 |
| Mean | 1855.574835 |
| Median Absolute Deviation (MAD) | 233.935 |
| Skewness | 10.84177856 |
| Sum | 121131925.2 |
| Variance | 81669915.44 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1214 | 1.9% |
| 24.88 | 103 | 0.2% |
| 606.84 | 100 | 0.2% |
| 639.82 | 97 | 0.1% |
| 601.9033 | 93 | 0.1% |
| 402.7 | 93 | 0.1% |
| 634.6133 | 93 | 0.1% |
| 918.1412 | 88 | 0.1% |
| 169.36 | 87 | 0.1% |
| 385.98 | 87 | 0.1% |
| Other values (17810) | 63225 |
| Value | Count | Frequency (%) |
| -255820.8 | 1 | < 0.1% |
| -245587.97 | 1 | < 0.1% |
| -238792.73 | 1 | < 0.1% |
| -231837.6 | 3 | |
| -222564.1 | 3 | |
| -127176 | 1 | < 0.1% |
| -122088.96 | 1 | < 0.1% |
| -84573.72 | 1 | < 0.1% |
| -81190.77 | 1 | < 0.1% |
| -53626 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 343532.66 | 2 | |
| 339103.35 | 1 | < 0.1% |
| 331487.76 | 2 | |
| 327213.75 | 1 | < 0.1% |
| 322454.09 | 1 | < 0.1% |
| 210371 | 4 | |
| 202995 | 4 | |
| 191196.5532 | 2 | |
| 189333.9 | 1 | < 0.1% |
| 182832.8832 | 2 |
Sales Margin Amount
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 21295 |
|---|---|
| Distinct (%) | 32.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1191.012887 |
| Minimum | -3932.93 |
|---|---|
| Maximum | 188800 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Negative | 576 |
| Negative (%) | 0.9% |
| Memory size | 510.1 KiB |
Quantile statistics
| Minimum | -3932.93 |
|---|---|
| 5-th percentile | 61.54 |
| Q1 | 129.9475 |
| median | 246.49 |
| Q3 | 579.39 |
| 95-th percentile | 3824.43 |
| Maximum | 188800 |
| Range | 192732.93 |
| Interquartile range (IQR) | 449.4425 |
Descriptive statistics
| Standard deviation | 5860.857507 |
|---|---|
| Coefficient of variation (CV) | 4.920901841 |
| Kurtosis | 324.9276471 |
| Mean | 1191.012887 |
| Median Absolute Deviation (MAD) | 140.265 |
| Skewness | 15.57141451 |
| Sum | 77749321.25 |
| Variance | 34349650.72 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 374.7 | 93 | 0.1% |
| 5317.17 | 88 | 0.1% |
| 6235.31 | 87 | 0.1% |
| 341.72 | 84 | 0.1% |
| 15.32 | 69 | 0.1% |
| 37.08 | 67 | 0.1% |
| 52.8 | 67 | 0.1% |
| 431.88 | 64 | 0.1% |
| 464.59 | 64 | 0.1% |
| 24.53 | 63 | 0.1% |
| Other values (21285) | 64534 |
| Value | Count | Frequency (%) |
| -3932.93 | 1 | |
| -3764.4 | 2 | |
| -3673.68 | 2 | |
| -3608.81 | 1 | |
| -3414.01 | 2 | |
| -3132.65 | 2 | |
| -2533.97 | 2 | |
| -2508.21 | 2 | |
| -2488.89 | 1 | |
| -2103.04 | 2 |
| Value | Count | Frequency (%) |
| 188800 | 1 | < 0.1% |
| 185907.2 | 2 | |
| 172624 | 3 | |
| 164339.2 | 2 | |
| 160480 | 2 | |
| 156773.4 | 1 | < 0.1% |
| 156521.04 | 1 | < 0.1% |
| 151056 | 3 | |
| 148401.6 | 3 | |
| 147487.37 | 2 |
Sales Cost Amount
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWED| Distinct | 5513 |
|---|---|
| Distinct (%) | 8.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1661.030116 |
| Minimum | 0 |
|---|---|
| Maximum | 366576 |
| Zeros | 347 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 510.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 85.36 |
| Q1 | 167.79 |
| median | 304.53 |
| Q3 | 687.4 |
| 95-th percentile | 4946.11 |
| Maximum | 366576 |
| Range | 366576 |
| Interquartile range (IQR) | 519.61 |
Descriptive statistics
| Standard deviation | 9556.62722 |
|---|---|
| Coefficient of variation (CV) | 5.753434047 |
| Kurtosis | 614.2579832 |
| Mean | 1661.030116 |
| Median Absolute Deviation (MAD) | 171.22 |
| Skewness | 21.01063149 |
| Sum | 108432045.9 |
| Variance | 91329123.83 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 449.69 | 534 | 0.8% |
| 475.75 | 457 | 0.7% |
| 0 | 347 | 0.5% |
| 134.67 | 305 | 0.5% |
| 162.89 | 289 | 0.4% |
| 205.72 | 253 | 0.4% |
| 159.14 | 242 | 0.4% |
| 16718.08 | 234 | 0.4% |
| 546.44 | 231 | 0.4% |
| 344.28 | 229 | 0.4% |
| Other values (5503) | 62159 |
| Value | Count | Frequency (%) |
| 0 | 347 | |
| 12.97 | 2 | < 0.1% |
| 19.55 | 4 | < 0.1% |
| 20.8 | 6 | < 0.1% |
| 26 | 1 | < 0.1% |
| 31.19 | 4 | < 0.1% |
| 33.97 | 3 | < 0.1% |
| 35.48 | 2 | < 0.1% |
| 35.54 | 5 | < 0.1% |
| 36.03 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 366576 | 7 | < 0.1% |
| 353292.8 | 4 | < 0.1% |
| 311589.6 | 12 | < 0.1% |
| 185048.85 | 2 | < 0.1% |
| 161446.35 | 5 | < 0.1% |
| 157412.85 | 2 | < 0.1% |
| 153635.03 | 5 | < 0.1% |
| 146630.4 | 4 | < 0.1% |
| 141265.56 | 36 | |
| 137736.24 | 4 | < 0.1% |
| Distinct | 64 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 137.4231924 |
| Minimum | 103 |
|---|---|
| Maximum | 185 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 510.1 KiB |
Quantile statistics
| Minimum | 103 |
|---|---|
| 5-th percentile | 104 |
| Q1 | 113 |
| median | 134 |
| Q3 | 160 |
| 95-th percentile | 180 |
| Maximum | 185 |
| Range | 82 |
| Interquartile range (IQR) | 47 |
Descriptive statistics
| Standard deviation | 26.64392588 |
|---|---|
| Coefficient of variation (CV) | 0.1938823092 |
| Kurtosis | -1.301510627 |
| Mean | 137.4231924 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | 0.3506954731 |
| Sum | 8970986 |
| Variance | 709.8987865 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 108 | 6225 | 9.5% |
| 180 | 4427 | 6.8% |
| 143 | 2926 | 4.5% |
| 117 | 2442 | 3.7% |
| 103 | 2162 | 3.3% |
| 104 | 2065 | 3.2% |
| 134 | 2033 | 3.1% |
| 115 | 1988 | 3.0% |
| 125 | 1967 | 3.0% |
| 157 | 1744 | 2.7% |
| Other values (54) | 37301 |
| Value | Count | Frequency (%) |
| 103 | 2162 | 3.3% |
| 104 | 2065 | 3.2% |
| 105 | 1184 | 1.8% |
| 107 | 1304 | 2.0% |
| 108 | 6225 | |
| 109 | 1137 | 1.7% |
| 110 | 594 | 0.9% |
| 111 | 542 | 0.8% |
| 112 | 486 | 0.7% |
| 113 | 1422 | 2.2% |
| Value | Count | Frequency (%) |
| 185 | 538 | 0.8% |
| 184 | 229 | 0.4% |
| 183 | 326 | 0.5% |
| 182 | 808 | 1.2% |
| 181 | 792 | 1.2% |
| 180 | 4427 | |
| 179 | 875 | 1.3% |
| 176 | 1083 | 1.7% |
| 175 | 1322 | 2.0% |
| 173 | 795 | 1.2% |
U/M
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.7 MiB |
| EA | |
|---|---|
| SE | 5629 |
| PR | 659 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 130560 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EA |
|---|---|
| 2nd row | EA |
| 3rd row | EA |
| 4th row | EA |
| 5th row | SE |
Common Values
| Value | Count | Frequency (%) |
| EA | 58992 | |
| SE | 5629 | 8.6% |
| PR | 659 | 1.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| ea | 58992 | |
| se | 5629 | 8.6% |
| pr | 659 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 64621 | |
| A | 58992 | |
| S | 5629 | 4.3% |
| P | 659 | 0.5% |
| R | 659 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 130560 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 64621 | |
| A | 58992 | |
| S | 5629 | 4.3% |
| P | 659 | 0.5% |
| R | 659 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 130560 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 64621 | |
| A | 58992 | |
| S | 5629 | 4.3% |
| P | 659 | 0.5% |
| R | 659 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 130560 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 64621 | |
| A | 58992 | |
| S | 5629 | 4.3% |
| P | 659 | 0.5% |
| R | 659 | 0.5% |
| Distinct | 1062 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 514.7091493 |
| Minimum | 0 |
|---|---|
| Maximum | 2760.7 |
| Zeros | 294 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 510.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 36.69 |
| Q1 | 181.56 |
| median | 325.19 |
| Q3 | 803.86 |
| 95-th percentile | 1431.23 |
| Maximum | 2760.7 |
| Range | 2760.7 |
| Interquartile range (IQR) | 622.3 |
Descriptive statistics
| Standard deviation | 449.1870286 |
|---|---|
| Coefficient of variation (CV) | 0.8727006878 |
| Kurtosis | 0.01247467261 |
| Mean | 514.7091493 |
| Median Absolute Deviation (MAD) | 217.35 |
| Skewness | 1.0054526 |
| Sum | 33600213.26 |
| Variance | 201768.9867 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 298 | 1508 | 2.3% |
| 1431.23 | 1426 | 2.2% |
| 966.44 | 1192 | 1.8% |
| 1275.1 | 1126 | 1.7% |
| 192.34 | 1041 | 1.6% |
| 1627.84 | 1035 | 1.6% |
| 157.76 | 988 | 1.5% |
| 1084.61 | 975 | 1.5% |
| 181.44 | 893 | 1.4% |
| 412.03 | 892 | 1.4% |
| Other values (1052) | 54204 |
| Value | Count | Frequency (%) |
| 0 | 294 | |
| 0.3929 | 150 | |
| 0.4 | 21 | < 0.1% |
| 0.405 | 25 | < 0.1% |
| 0.41 | 10 | < 0.1% |
| 0.445 | 6 | < 0.1% |
| 0.52 | 1 | < 0.1% |
| 0.61 | 4 | < 0.1% |
| 1.6236 | 2 | < 0.1% |
| 1.8711 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 2760.7 | 12 | < 0.1% |
| 2291.4 | 7 | < 0.1% |
| 2267 | 10 | < 0.1% |
| 1975 | 113 | |
| 1920 | 61 | |
| 1880 | 19 | < 0.1% |
| 1759.4 | 45 | 0.1% |
| 1731.4 | 35 | 0.1% |
| 1691.4 | 12 | < 0.1% |
| 1688.13 | 150 |
| Distinct | 14788 |
|---|---|
| Distinct (%) | 22.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 283.6968508 |
| Minimum | 0.3373411765 |
|---|---|
| Maximum | 6035 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 510.1 KiB |
Quantile statistics
| Minimum | 0.3373411765 |
|---|---|
| 5-th percentile | 22.42555556 |
| Q1 | 100.07 |
| median | 183.75825 |
| Q3 | 448.22 |
| 95-th percentile | 789.66725 |
| Maximum | 6035 |
| Range | 6034.662659 |
| Interquartile range (IQR) | 348.15 |
Descriptive statistics
| Standard deviation | 252.0316598 |
|---|---|
| Coefficient of variation (CV) | 0.8883837061 |
| Kurtosis | 6.882532688 |
| Mean | 283.6968508 |
| Median Absolute Deviation (MAD) | 116.47175 |
| Skewness | 1.418975272 |
| Sum | 18519730.42 |
| Variance | 63519.95752 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 140.43 | 191 | 0.3% |
| 817.68 | 189 | 0.3% |
| 133.41 | 181 | 0.3% |
| 783.17 | 138 | 0.2% |
| 824.39 | 138 | 0.2% |
| 23.47 | 136 | 0.2% |
| 82.87333333 | 125 | 0.2% |
| 221.04 | 120 | 0.2% |
| 230.25 | 120 | 0.2% |
| 230.98 | 120 | 0.2% |
| Other values (14778) | 63822 |
| Value | Count | Frequency (%) |
| 0.3373411765 | 2 | < 0.1% |
| 0.3514 | 1 | < 0.1% |
| 0.3619411765 | 1 | < 0.1% |
| 0.37718 | 67 | |
| 0.384 | 9 | < 0.1% |
| 0.3888 | 12 | < 0.1% |
| 0.3929 | 67 | |
| 0.3936 | 5 | < 0.1% |
| 0.4 | 9 | < 0.1% |
| 0.40469 | 16 | < 0.1% |
| Value | Count | Frequency (%) |
| 6035 | 1 | |
| 3748 | 2 | |
| 3233.36 | 1 | |
| 3009.86 | 1 | |
| 3003.41 | 1 | |
| 2823 | 1 | |
| 2753.32 | 1 | |
| 2560 | 1 | |
| 2540.17 | 1 | |
| 2360.1 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | CustKey | Item | Invoice Date | Invoice_Year | Invoice_Quarter | Invoice_Month | Invoice_Day | Sales Quantity | Sales Amount | Sales Amount Based on List Price | Discount Amount | Sales Margin Amount | Sales Cost Amount | Sales Rep | U/M | List Price | Sales Price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 10000481 | Urban Large Eggs | 2017-04-30 | 2017 | 2 | 4 | 30 | 1 | 237.91 | 0.000 | -237.910 | 237.91 | 0.0 | 184 | EA | 0.000 | 237.910000 |
| 1 | 1 | 10002220 | Moms Sliced Turkey | 2017-07-14 | 2017 | 3 | 7 | 14 | 1 | 456.17 | 824.960 | 368.790 | 456.17 | 0.0 | 127 | EA | 824.960 | 456.170000 |
| 2 | 2 | 10002220 | Cutting Edge Foot-Long Hot Dogs | 2017-10-17 | 2017 | 4 | 10 | 17 | 1 | 438.93 | 548.660 | 109.730 | 438.93 | 0.0 | 127 | EA | 548.660 | 438.930000 |
| 3 | 3 | 10002489 | Kiwi Lox | 2017-06-03 | 2017 | 2 | 6 | 3 | 1 | 211.75 | 0.000 | -211.750 | 211.75 | 0.0 | 160 | EA | 0.000 | 211.750000 |
| 4 | 4 | 10004516 | High Top Sweet Onion | 2017-05-27 | 2017 | 2 | 5 | 27 | 455 | 89248.66 | 185876.600 | 96627.940 | 89248.66 | 0.0 | 124 | SE | 408.520 | 196.150901 |
| 5 | 5 | 10004516 | Best Choice Fudge Brownies | 2017-05-30 | 2017 | 2 | 5 | 30 | 1 | 1950.00 | 0.000 | -1950.000 | 1950.00 | 0.0 | 124 | EA | 0.000 | 1950.000000 |
| 6 | 6 | 10007866 | Moms Sliced Turkey | 2017-09-03 | 2017 | 3 | 9 | 3 | 1 | 424.30 | 795.314 | 371.014 | 424.30 | 0.0 | 149 | EA | 795.314 | 424.300000 |
| 7 | 7 | 10009356 | Tell Tale Garlic | 2017-06-18 | 2017 | 2 | 6 | 18 | 2 | 541.92 | 1150.000 | 608.080 | 541.92 | 0.0 | 103 | EA | 575.000 | 270.960000 |
| 8 | 8 | 10009356 | High Top Walnuts | 2017-06-18 | 2017 | 2 | 6 | 18 | 15 | 353.40 | 778.200 | 424.800 | 353.40 | 0.0 | 103 | EA | 51.880 | 23.560000 |
| 9 | 9 | 10009356 | Big Time Frozen Cheese Pizza | 2017-06-18 | 2017 | 2 | 6 | 18 | 60 | 11229.00 | 24721.800 | 13492.800 | 11229.00 | 0.0 | 103 | EA | 412.030 | 187.150000 |
Last rows
| df_index | CustKey | Item | Invoice Date | Invoice_Year | Invoice_Quarter | Invoice_Month | Invoice_Day | Sales Quantity | Sales Amount | Sales Amount Based on List Price | Discount Amount | Sales Margin Amount | Sales Cost Amount | Sales Rep | U/M | List Price | Sales Price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 65270 | 65272 | 10017638 | Blue Label Canned Beets | 2018-03-21 | 2018 | 1 | 3 | 21 | 2 | 671.90 | 1268.22 | 596.32 | 127.79 | 544.11 | 180 | EA | 634.11 | 335.950000 |
| 65271 | 65273 | 10017638 | Moms Sliced Turkey | 2018-03-21 | 2018 | 1 | 3 | 21 | 12 | 5244.76 | 9899.52 | 4654.76 | 2186.28 | 3058.48 | 180 | EA | 824.96 | 437.063333 |
| 65272 | 65274 | 10017638 | Gorilla Strawberry Yogurt | 2018-03-21 | 2018 | 1 | 3 | 21 | 18 | 1783.40 | 3366.18 | 1582.78 | 1015.17 | 768.23 | 180 | EA | 187.01 | 99.077778 |
| 65273 | 65275 | 10017638 | Gorilla Jack Cheese | 2018-03-21 | 2018 | 1 | 3 | 21 | 2 | 1110.30 | 2206.00 | 1095.70 | 265.75 | 844.55 | 180 | EA | 1103.00 | 555.150000 |
| 65274 | 65276 | 10017638 | Blue Label Fancy Canned Oysters | 2018-03-21 | 2018 | 1 | 3 | 21 | 40 | 312.79 | 590.40 | 277.61 | 44.39 | 268.40 | 180 | EA | 14.76 | 7.819750 |
| 65275 | 65277 | 10017638 | High Top Oranges | 2018-03-21 | 2018 | 1 | 3 | 21 | 9 | 569.90 | 1075.68 | 505.78 | 329.95 | 239.95 | 180 | EA | 119.52 | 63.322222 |
| 65276 | 65278 | 10017638 | Landslide White Sugar | 2018-03-21 | 2018 | 1 | 3 | 21 | 2 | 462.81 | 873.56 | 410.75 | 39.26 | 423.55 | 180 | EA | 436.78 | 231.405000 |
| 65277 | 65279 | 10017638 | Moms Potato Salad | 2018-03-21 | 2018 | 1 | 3 | 21 | 8 | 987.20 | 1863.36 | 876.16 | 413.20 | 574.00 | 180 | EA | 232.92 | 123.400000 |
| 65278 | 65280 | 10017638 | Better Fancy Canned Sardines | 2018-03-21 | 2018 | 1 | 3 | 21 | 36 | 27297.51 | 51524.28 | 24226.77 | 11108.61 | 16188.90 | 180 | EA | 1431.23 | 758.264167 |
| 65279 | 65281 | 10017638 | Imagine Popsicles | 2018-03-21 | 2018 | 1 | 3 | 21 | 48 | 27582.02 | 52061.28 | 24479.26 | 13347.80 | 14234.22 | 180 | EA | 1084.61 | 574.625417 |